CDS

Accession Number TCMCG078C01835
gbkey CDS
Protein Id KAG0449488.1
Location join(161057..161988,162376..162484,163293..163428,164020..164123)
Organism Vanilla planifolia
locus_tag HPP92_027335

Protein

Length 426aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000184.1
Definition hypothetical protein HPP92_027335 [Vanilla planifolia]
Locus_tag HPP92_027335

EGGNOG-MAPPER Annotation

COG_category M
Description Glycosyltransferase family 28 C-terminal domain
KEGG_TC -
KEGG_Module -
KEGG_Reaction R05032        [VIEW IN KEGG]
R05662        [VIEW IN KEGG]
KEGG_rclass RC00005        [VIEW IN KEGG]
RC00049        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01011        [VIEW IN KEGG]
KEGG_ko ko:K02563        [VIEW IN KEGG]
EC 2.4.1.227        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00550        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01502        [VIEW IN KEGG]
ko04112        [VIEW IN KEGG]
map00550        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01502        [VIEW IN KEGG]
map04112        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCCGCCGTGCCTGCTGGTGTTTCCTTGGCCAACTCACCGAGGCATAATAGGCTTTCTCGGTGCCTTTTTACTTCTCAAGGGAGAGCAAATTCTGTGGTGAAGTTCCGAGCTTTGCAGCTCCGGCTTAATGCTGTCGCCGCTGAAAGCTCCACTGGTTTATGCGTGGTATTGGCAGGAGGTGGCTCCGGCGGTCATATCTTCCCCGCCCTCGCCATCGCTGATGAGATCCGTAATGTTCGTCCCGATGCCCGACTCGTCTTCCTCGGGACATCAACCGGTATGGAGCGGAATGTCATCCCGTCCGCTGGCTACGGCTTTGTCCCTGTCCCCAAGGTTCGCCTTTCCCGCCCCTTTCTCTCCCCTCTCAACCTCCTTTTCCCCTTCCGGCTCCTCCACTCCGTCGCCGCCAGCACCGCCATCCTCTCCCGTCTCAGACCCCAAGTAGTCGTCGGCACCGGGGCCTACGTCTCAGCCCCCGTTTGCTTCGCTGCCATCATATCTGGTATCAAGCTCGTTATTCAGGAGCAGAATTGTTTTCCCGGGATCTCCAACCGTGCTCTGGCCCCCTATGCTGAGAAGATTTTTCTCGCCTTCAATGCTTGCATCAAGTACTTCCCCAAAGAGAAATGTGTAGTCTGTGGGAACCCTCGCCGGCTCAGTGGTGGCAAAGGGGATGAAGTTCGCAAGAAGGAAGCTGTGTTGCATTTCTTTCCCAAATCTGATGACTTGGCCATGGACGAGAGAGCTCATGTGGTTCTTGTGCTGGGCGGTTCAACTGGAGCCAATGCCTTGAATCATGCTTTCATGGAGTTCTGCTATGAAATGCTCATGGAACATAAGAACAGTTTCATTATATGGCAGACGGGGGAAGAGTGGTTTGAGGAGGTCAAAAGAAGTCACAAGGCTCATCCCAGATTGTTGATTATTCCGTTTTTGGAAGAAATGGAATTAGCTTATACAGCTGCTGATCTTGTTGTAACTCGAGCTGGAGCAATGACGTGCACAGAAATACTAACAACGGGGAAGCCTTCTATTCTGATACCATCACCAACAGCGACTGATGATCACCAAACAAAAAATGCATATGCCATGGCAGACTTAGCCGGATCCATAGTTCTAACAGAAGATGAGCTTAATTCCAGTAGTCTGCAGACAGCCATCAATAGTGTGTTAGGTGACGATAAGTTGATGGAAGAAATGTCAGATAAGGCAAGGAGAGCTGCTAGACCTCATGCTGCTTCTTATATTGCAGAAAGCATTCTCTCCCTTTTAGATTAG
Protein:  
MAAVPAGVSLANSPRHNRLSRCLFTSQGRANSVVKFRALQLRLNAVAAESSTGLCVVLAGGGSGGHIFPALAIADEIRNVRPDARLVFLGTSTGMERNVIPSAGYGFVPVPKVRLSRPFLSPLNLLFPFRLLHSVAASTAILSRLRPQVVVGTGAYVSAPVCFAAIISGIKLVIQEQNCFPGISNRALAPYAEKIFLAFNACIKYFPKEKCVVCGNPRRLSGGKGDEVRKKEAVLHFFPKSDDLAMDERAHVVLVLGGSTGANALNHAFMEFCYEMLMEHKNSFIIWQTGEEWFEEVKRSHKAHPRLLIIPFLEEMELAYTAADLVVTRAGAMTCTEILTTGKPSILIPSPTATDDHQTKNAYAMADLAGSIVLTEDELNSSSLQTAINSVLGDDKLMEEMSDKARRAARPHAASYIAESILSLLD